172 research outputs found

    Reducing the burden of parallel loop schedulers for many-core processors

    Get PDF
    Funder: FP7 People: Marie‐Curie Actions; Id: http://dx.doi.org/10.13039/100011264; Grant(s): 327744Summary: As core counts in processors increases, it becomes harder to schedule and distribute work in a timely and scalable manner. This article enhances the scalability of parallel loop schedulers by specializing schedulers for fine‐grain loops. We propose a low‐overhead work distribution mechanism for a static scheduler that uses no atomic operations. We integrate our static scheduler with the Intel OpenMP and Cilkplus parallel task schedulers to build hybrid schedulers. Compiler support enables efficient reductions for Cilk, without changing the programming interface of Cilk reducers. Detailed, quantitative measurements demonstrate that our techniques achieve scalable performance on a 48‐core machine and the scheduling overhead is 43% lower than Intel OpenMP and 12.1× lower than Cilk. We demonstrate consistent performance improvements on a range of HPC and data analytics codes. Performance gains are more important as loops become finer‐grain and thread counts increase. We observe consistently 16%–30% speedup on 48 threads, with a peak of 2.8× speedup

    The role of response modalities in cognitive task representations

    Get PDF
    The execution of a task necessitates the use of a specific response modality. We examined the role of different response modalities by using a task-switching paradigm. In Experiment 1, subjects switched between two numerical judgments, whereas response modality (vocal vs. manual vs. foot responses) was manipulated between groups. We found judgment-shift costs in each group, that is irrespective of the response modality. In Experiment 2, subjects switched between response modalities (vocal vs. manual, vocal vs. foot, or manual vs. foot). We observed response-modality shift costs that were comparable in all groups. In sum, the experiments suggest that the response modality (combination) does not affect switching per se. Yet, modality-shift costs occur when subjects switch between response modalities. Thus, we suppose that modality-shift costs are not due to a purely motor-related mechanisms but rather emerge from a general switching process. Consequently, the response modality has to be considered as a cognitive component in models of task switching

    Accelerated large-scale multiple sequence alignment

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Multiple sequence alignment (MSA) is a fundamental analysis method used in bioinformatics and many comparative genomic applications. Prior MSA acceleration attempts with reconfigurable computing have only addressed the first stage of progressive alignment and consequently exhibit performance limitations according to Amdahl's Law. This work is the first known to accelerate the third stage of progressive alignment on reconfigurable hardware.</p> <p>Results</p> <p>We reduce subgroups of aligned sequences into discrete profiles before they are pairwise aligned on the accelerator. Using an FPGA accelerator, an overall speedup of up to 150 has been demonstrated on a large data set when compared to a 2.4 GHz Core2 processor.</p> <p>Conclusions</p> <p>Our parallel algorithm and architecture accelerates large-scale MSA with reconfigurable computing and allows researchers to solve the larger problems that confront biologists today. Program source is available from <url>http://dna.cs.byu.edu/msa/</url>.</p

    Commissioning and operation of the readout system for the solid neutrino detector

    Get PDF
    The SoLid experiment aims to measure neutrino oscillation at a baseline of 6.4 m from the BR2 nuclear reactor in Belgium. Anti-neutrinos interact via inverse beta decay (IBD), resulting in a positron and neutron signal that are correlated in time and space. The detector operates in a surface building, with modest shielding, and relies on extremely efficient online rejection of backgrounds in order to identify these interactions. A novel detector design has been developed using 12800 5 cm cubes for high segmentation. Each cube is formed of a sandwich of two scintillators, PVT and 6LiF:ZnS(Ag), allowing the detection and identification of positrons and neutrons respectively. The active volume of the detector is an array of cubes measuring 80x80x250 cm (corresponding to a fiducial mass of 1.6 T), which is read out in layers using two dimensional arrays of wavelength shifting fibres and silicon photomultipliers, for a total of 3200 readout channels. Signals are recorded with 14 bit resolution, and at 40 MHz sampling frequency, for a total raw data rate of over 2 Tbit/s. In this paper, we describe a novel readout and trigger system built for the experiment, that satisfies requirements on: compactness, low power, high performance, and very low cost per channel. The system uses a combination of high price-performance FPGAs with a gigabit Ethernet based readout system, and its total power consumption is under 1 kW. The use of zero suppression techniques, combined with pulse shape discrimination trigger algorithms to detect neutrons, results in an online data reduction factor of around 10000. The neutron trigger is combined with a large per-channel history time buffer, allowing for unbiased positron detection. The system was commissioned in late 2017, with successful physics data taking established in early 2018

    SoLid: A short baseline reactor neutrino experiment

    Get PDF
    The SoLid experiment, short for Search for Oscillations with a Lithium-6 detector, is a new generation neutrino experiment which tries to address the key challenges for high precision reactor neutrino measurements at very short distances from a reactor core and with little or no overburden. The primary goal of the SoLid experiment is to perform a precise measurement of the electron antineutrino energy spectrum and flux and to search for very short distance neutrino oscillations as a probe of eV-scale sterile neutrinos. This paper describes the SoLid detection principle, the mechanical design and the construction of the detector. It then reports on the installation and commissioning on site near the BR2 reactor, Belgium, and finally highlights its performance in terms of detector response and calibration
    • 

    corecore